Combined Image and Text Document

ABSTRACT

A combined image and text document is described. In embodiment(s), a scanned image of a document can be generated utilizing a scanning application, and text representations of text that is included in the document can be generated utilizing a character recognition application. Position data of the text representations can be correlated with locations of corresponding text in the scanned image of the document. The scanned image can then be rendered for display overlaid with the text representations as a transparent overlay, where the scanned image and the text representations are independently user-selectable for display. A user-selectable input can be received to display the text representations without the scanned image, the scanned image without the text representations, or to display the text representations adjacent the scanned image.

BACKGROUND

A printed or other paper document can be scanned as an image and savedas an electronic document. A scanned image of a document provides anaccurate visual representation of the document's content, such as textof the document, handwritten notes, a written signature, a watermark,and/or other visual markings. However, the text of a document in ascanned image cannot be searched for, selected, and/or copied.

Optical character recognition (OCR) technology is typically utilized toconvert document text from an original document, or document text in ascanned image, into an editable text representation that enables thetext to be searched, selected, and/or copied. However, a textrepresentation of document text is typically generated as a text filethat does not retain the visual appearance or representation of anoriginal document. For example, original document features such as theformatting, layout, and artifacts of the original document, as well ashandwritten notes, a written signature, a watermark, and/or other visualmarkings are not retained when a text file of a document is generated.

SUMMARY

This summary is provided to introduce simplified concepts of a combinedimage and text document. The simplified concepts are further describedbelow in the Detailed Description. This summary is not intended toidentify essential features of the claimed subject matter, nor is itintended for use in determining the scope of the claimed subject matter.

A combined image and text document is described. In embodiments, ascanned image of a document can be generated utilizing a scanningapplication, and text representations of text that is included in thedocument can be generated utilizing a character recognition application.Position data of the text representations can be correlated withlocations of corresponding text in the scanned image of the document.The scanned image can then be rendered for display overlaid with thetext representations as a transparent overlay, where the scanned imageand the text representations are independently user-selectable fordisplay. A user-selectable input can be received to display the textrepresentations without the scanned image, the scanned image without thetext representations, or to display the text representations adjacentthe scanned image.

In other embodiments of a combined image and text document, a markupdocument can be generated that includes the text representations,associated display alignment information, and a reference to the scannedimage for display. Selected text can be searched from the textrepresentations, and the transparent overlay of the selected text can behighlighted to display an appearance that the corresponding text in thescanned image is highlighted. In an alternate embodiment, the textrepresentations can be rendered as a visual overlay for display over thescanned image to illustrate a comparison of the text representationswith the corresponding text in the scanned image that reflects anaccuracy of the character recognition application. Additionally, thescanned image can be displayed in a first color overlaid with the textrepresentations displayed in a second color that contrasts with thefirst color to further illustrate the comparison of the textrepresentations with the corresponding text in the scanned image.

BRIEF DESCRIPTION OF THE DRAWINGS

Embodiments of a combined image and text document are described withreference to the following drawings. The same numbers are usedthroughout the drawings to reference like features and components:

FIG. 1 illustrates an example system in which embodiments of a combinedimage and text document can be implemented.

FIG. 2 illustrates an example combined image and text document inaccordance with one or more embodiments.

FIG. 3 illustrates example method(s) for a combined image and textdocument in accordance with one or more embodiments.

FIG. 4 illustrates example method(s) for a combined image and textdocument in accordance with one or more embodiments.

FIG. 5 illustrates various components of an example device that canimplement embodiments of a combined image and text document.

DETAILED DESCRIPTION

Embodiments of a combined image and text document provide that adocument can be scanned to generate a scanned image of the document.Text representations of text that is included in the document can thenbe generated from the original document or from the scanned image. Amarkup document can be generated that includes the text representationsof the text that is included in the original document, associateddisplay alignment information, and a reference to the scanned image. Thescanned image can then be displayed and overlaid with the textrepresentations as a transparent overlay of the scanned image. In anembodiment, selected text can be searched for in the textrepresentations and/or searched for in the markup document, and thetransparent overlay of the selected text can be highlighted to displayan appearance that the corresponding text in the scanned image ishighlighted.

The scanned image and the text representations can also be independentlyuser-selectable for display. For example, a user can initiate a userinput to display the text representations adjacent the scanned image ona display so that both the scanned image and the text representationsare viewable side-by-side. Alternatively, a user can select to displayjust the text representations without the scanned image, or just thescanned image without the text representations.

While features and concepts of the described systems and methods for acombined image and text document can be implemented in any number ofdifferent environments, systems, and/or various configurations,embodiments of a combined image and text document are described in thecontext of the following example systems and environments.

FIG. 1 illustrates an example system 100 in which various embodiments ofa combined image and text document can be implemented. Example system100 is a document representation system that includes an exampledocument 102, a device 104 that can be implemented to process thedocument, and a display device 106. The example document 102 includestext 108, a handwritten signature 110, a handwritten note 112, and animage 114. An example document may also include a watermark, pagenumber, multiple pages, and/or other visual markings.

Device 104 can be implemented as a computer device that includesdocument processing features to scan and copy a document. Alternativelyor in addition, device 104 can be connected to a multi-function deviceor independent devices that scan, copy, and/or otherwise manage andprocess documents. Device 104 can be implemented with processors,communication components, data inputs, memory components, processing andcontrol circuits, and a content rendering system. Device 104 can also beimplemented with any number and combination of differing components asdescribed with reference to the example device shown in FIG. 5. Device104 may also be associated with a user (i.e., a person) and/or an entitythat operates the device such that a device describes logical devicesthat include users, software, and/or a combination of devices.

In this example, device 104 includes one or more processors 116 (e.g.,any of microprocessors, controllers, and the like) that process variouscomputer-executable instructions to implement embodiments of a combinedimage and text document. Alternatively or in addition, device 104 can beimplemented with any one or combination of hardware, firmware, or fixedlogic circuitry that is implemented in connection with processing andcontrol circuits. Although not shown, device 104 can include a systembus or data transfer system that couples the various components withinthe device, or to other devices. Device 104 can also includecommunication interfaces that can be implemented as a serial and/orparallel interface, a wireless interface, any type of network interface,a modem, and/or as any other type of communication interface. Thecommunication interfaces provide connection and/or communication linksbetween device 104 and a communication network (e.g., LAN, WLAN) and/orother document processing devices.

Client device 104 can also include a device manager 118 (e.g., a controlapplication, software application, signal processing and control module,code that is native to a particular device, a hardware abstraction layerfor a particular device, etc.). Client device 104 also includes arendering system 120 that can be implemented to render scanned images,text representations, and/or a user interface for display at displaydevice 106 to implement embodiments of a combined image and textdocument.

Device 104 also includes storage media 122 to store or otherwisemaintain various data and information. The storage media 122 can beimplemented as any type of memory, magnetic or optical disk storage,and/or other suitable electronic data storage. The storage media 122 canalso be referred to or implemented as computer-readable media, such asone or more memory components, that provide data storage mechanisms tostore various device applications 124 and any other types of informationand/or data related to operational aspects of device 104. For example,an operating system and/or software modules and components can bemaintained as computer applications with storage media 122 and executedon processors 116.

Device 104 includes a character recognition application 126, a scanningapplication 128, and a display application 130 that can each beimplemented as computer-executable instructions and executed byprocessors 116 to implement embodiments of a combined image and textdocument. The character recognition application 126, scanningapplication 128, and display application 130 are examples of deviceapplications 124 that can be maintained by the storage media 122.Although illustrated and described as applications of device 104, thedevice applications, as well as other functionality to implement thevarious embodiments described herein, can be provided as a service or asdevices apart from device 104 (e.g., on a separate device or by a thirdparty service).

The scanning application 128 can be implemented to scan documents andgenerate scanned images 132 that can be maintained with the storagemedia 122 at device 104. For example, the scanning application 128 canscan document 102 and generate a scanned image 134 of the document asshown on display device 106. A scanned image can be generated in anyapplicable format, and may employ a compression scheme such as TIFF, FAXG3/G4, PNG, JPEG, MRC, XPS, PDF, and the like.

The character recognition application 126 can be implemented to scandocuments and generate text representations 136 of text that is includedin the documents. For example, the character recognition application 126can scan document 102 and generate text representations 138 of the text108 that is included in document 102, as shown on display device 106.Text representations can be generated utilizing any applicable opticalcharacter recognition (OCR) technique or algorithm.

In this example, the text representations 138 are displayed adjacent thescanned image 134 on display device 106. In embodiments, the scannedimage 134 and the text representations 138 are independentlyuser-selectable for display. The display application 130 can receive auser-selectable input at device 104 to initiate the display of the textrepresentations 138 adjacent the scanned image 134 so that both thescanned image and the text representations are viewable side-by-side.Alternatively, the display application 130 can receive a user-selectableinput at device 104 to initiate a display of the text representations138 without the scanned image, or a display of the scanned image 134without the text representations. The character recognition application126 can also be implemented to correlate position data of the textrepresentations 138 with locations of corresponding original text 108 inthe scanned image 134 of the document 102 when generating the textrepresentations for display.

The display application 130 can be implemented to correlate with therendering system 120 to manage the display of scanned images and textrepresentations on display device 106. The display application 130 canalso be implemented to generate a markup document 140 that includes thetext representations 138, associated display alignment information, anda reference to the scanned image 134. The markup document 140 can beimplemented as an XML-based markup, or in any other applicable formatthat describes a page for display on display device 106. In anembodiment, a markup document can include data that represents both thescanned image 134 and the text representations 138 for display. In animplementation, the markup document 140, and a document packagecontaining the scanned image 134 and the markup document 140, can begenerated directly on a scanning device and/or generated utilizingscanning application 128.

An example of the markup document 140 for the scanned image 134 and thetext representations 138 in this example can include:

<FixedPage Width=″836″ Height=″1024″ <FixedPage.Resources>  <ResourceDictionary>      <ImageBrush x:Key=″Image1″     ImageSource=″scanned image 134″       Viewbox=″0,0,718,260″      Stretch=″Fill″       TileMode=″None″       ViewboxUnits=″Absolute″      ViewportUnits=″Absolute″       Viewport=″91.5,79.5,359,130″ />  </ResourceDictionary> </FixedPage.Resources>   <PathFill=″{StaticResource Image1}″ Data=″M 91.5,79.5 L 450.5,79.5450.5,209.5 91.5,209.5 z″ />   <Glyphs UnicodeString=″Document″OriginX=″96″ OriginY=″96″ FontRenderingEmSize=″16″ Fill=″#00FFFFFF″FontUri=″arial″ xml:lang=″en-US″ ocr:Source=”Image1” />   <GlyphsUnicodeString=″This document text″ OriginX=″96″ OriginY=″120″FontRenderingEmSize=″12″ Fill=″#00FFFFFF″ FontUri=″arial″xml:lang=″en-US″ ocr:Source=”Image1”/>   <Glyphs UnicodeString=″includesdetails″ OriginX=″96″ OriginY=″135″ FontRenderingEmSize=″12″Fill=″#00FFFFFF″ FontUri=″arial″ xml:lang=″en-US″ ocr:Source=”Image1”/>  <Glyphs UnicodeString=″about this document″ OriginX=″96″ OriginY=″150″FontRenderingEmSize=″12″ Fill=″#00FFFFFF″ FontUri=″arial″xml:lang=″en-US″ ocr:Source=”Image1”/> </FixedPage>

This example markup document 140 includes a FixedPage Resources sectionthat references scanned image 134 and associated display and alignmentinformation. The example markup document 140 also includes glyphelements that are represented by a path element in the document. Theglyph elements each include a UnicodeString attribute that is a line oftext representation from the text in the original document 102. In anembodiment, a glyph element includes a Fill attribute (e.g.,Fill=“#00FFFFFF”) that is a code for a transparent color value thatdefines a text element display as transparent characters. An unmodifiedviewer application (e.g., display application 130) can utilize existingfunctionality for text selection, search, copy, screen reading, and/orother functionality to process the transparent (e.g., invisible,non-viewable) text representations. An optional attribute can beattached to each transparent text element that indicates to a viewerapplication that the text was generated with an OCR technique from theunderlying scanned image.

FIG. 2 illustrates an example 200 of a combined image and text documentdisplayed on a display device in accordance with one or moreembodiments. The example 200 illustrates a display of the scanned image134 overlaid with the text representations 138 as a transparent overlay202. In an embodiment, the display application 130 can be implemented toreceive a text search request for selected text (e.g., a word orphrase), initiate a search for the selected text in the textrepresentations 138, and then highlight the transparent overlay of theselected text to display an appearance that the corresponding text inthe scanned image is highlighted. For example, the display application130 can receive a text search request for the text “details” that isincluded in the original document 102, and initiate a search for thetext in the text elements of the markup document 140. When the selectedtext is located, the text can be highlighted at 204 to display anappearance that the corresponding text in the scanned image 134 ishighlighted at 206 when viewed through the transparent overlay 202 atviewing angle 208.

In this example, the text representations 138 are shown as beingdisplayed offset from the scanned image 134, when in practice the textrepresentations 138 are displayed over the scanned image 134 such thatthe text representations line up over the corresponding text in thescanned image 134. In addition, the text representations 138 are visiblein this example of a transparent overlay for discussion purposes only.As described above, the text representations can also be transparenttext characters displayed over the scanned image 134 in a transparentcolor so that a viewer would only see the content of the scanned image.

In an alternate embodiment, the text representations 138 can beinitiated for display as a visual overlay (rather than as a transparentoverlay) for display over the scanned image 134 to illustrate acomparison of the text representations 138 with the corresponding textin the scanned image 134. A visible comparison of the textrepresentations 138 with the scanned image 134 can serve to reflect anaccuracy of the character recognition application 126 that is utilizedto generate the text representations. A user can visualize and discernany differences or inaccuracies in the OCR technique utilized togenerate the text representations 138.

In another embodiment, the scanned image 134 can be displayed in a firstcolor overlaid with the text representations 138 displayed in a secondcolor that contrasts with the first color to further illustrate acomparison of the text representations 138 with the corresponding textin the scanned image 134. Utilizing a semi-transparent fill color forthe text representations 138, a user can directly verify which portionsof the scanned image 134 were correctly recognized as text when the textrepresentations 138 are generated.

Example methods 300 and 400 are described with reference to respectiveFIGS. 3 and 4 in accordance with one or more embodiments of a combinedimage and text document. Generally, any of the functions, methods,procedures, components, and modules described herein can be implementedusing hardware, software, firmware, fixed logic circuitry, manualprocessing, or any combination thereof. A software implementation of afunction, method, procedure, component, or module represents programcode that performs specified tasks when executed on a computing-basedprocessor. Example method 300 may be described in the general context ofcomputer-executable instructions, which can include software,applications, routines, programs, objects, components, data structures,procedures, modules, functions, and the like.

The method(s) may also be practiced in a distributed computingenvironment where functions are performed by remote processing devicesthat are linked through a communication network. In a distributedcomputing environment, computer-executable instructions may be locatedin both local and remote computer storage media, including memorystorage devices. Further, the features described herein areplatform-independent such that the techniques may be implemented on avariety of computing platforms having a variety of processors.

FIG. 3 illustrates example method(s) 300 of a combined image and textdocument. The order in which the method is described is not intended tobe construed as a limitation, and any number of the described methodblocks can be combined in any order to implement the method, or analternate method.

At block 302, a scanned image of a document is generated utilizing ascanning application. For example, scanning application 128 at device104 (FIG. 1) generates a scanned image 134 of document 102. The scannedimage 134 includes an image of the original document text 108, ahandwritten signature 110, a handwritten note 112, and an image 114. Atblock 304, text representations of the text that is included in thedocument is generated utilizing a character recognition application. Forexample, character recognition application 126 at device 104 generatestext representations 138 of text 108 that is included in document 102.

At block 306, position data of the text representations is correlatedwith locations of corresponding text in the scanned image of thedocument. For example, character recognition application 126 correlatesposition data of the text representations 138 with locations ofcorresponding original text 108 in the scanned image 134 of document 102when generating the text representations.

At block 308, a markup document is generated that includes the textrepresentations, associated display alignment information, and areference to the scanned image for display. For example, displayapplication 130 at device 104 generates the markup document 140 thatincludes the text representations 138, associated display alignmentinformation, and a reference to the scanned image 134. In an embodiment,a markup document can include data that represents both the scannedimage 134 and the text representations 138 for display.

At block 310, the scanned image is rendered for display overlaid withthe text representations as a transparent overlay. For example, adisplay of the scanned image 134 is overlaid with the textrepresentations 138 as a transparent overlay 202 (FIG. 2). Inembodiments, the scanned image 134 and the text representations 138 areindependently user-selectable for display. The display application 130at device 104 can receive a user-selectable input to initiate thedisplay of the text representations 138 adjacent the scanned image 134.Alternatively, the display application 130 can receive a user-selectableinput to initiate a display of the text representations 138 without thescanned image, or a display of the scanned image 134 without the textrepresentations.

At block 312, selected text is searched for in the text representationsthat are displayed in a transparent color over the scanned image and, atblock 314, the transparent overlay of the selected text is highlightedto display an appearance that the corresponding text in the scannedimage is highlighted. For example, display application 130 receives atext search request for the text “details” that is included in theoriginal document 102, and initiates a search for the selected text inthe text elements of the markup document 140. When the selected text islocated, the text is highlighted at 204 to display an appearance thatthe corresponding text in the scanned image 134 is highlighted at 206when viewed through the transparent overlay 202 at viewing angle 208.

FIG. 4 illustrates example method(s) 400 of a combined image and textdocument. The order in which the method is described is not intended tobe construed as a limitation, and any number of the described methodblocks can be combined in any order to implement the method, or analternate method.

At block 402, a user-selectable input is received to initiate display ofthe text representations without the scanned image. For example, thescanned image 134 and the text representations 138 (FIG. 1) areindependently user-selectable for display, such as by a user at device104. The display application 130 receives a user-selectable input atdevice 104 to initiate a display of the text representations 138 withoutthe scanned image 134. Alternatively at block 404, a user-selectableinput is received to initiate a display of the scanned image without thetext representations. For example, display application 130 receives auser-selectable input at device 104 to initiate a display of the scannedimage 134 without the text representations 138. A user can togglebetween displaying the scanned image 134 and the text representations138 on display device 106.

At block 406, a user-selectable input is received to initiate display ofthe text representations adjacent the scanned image. For example,display application 130 receives a user-selectable input at device 104to initiate a display of the text representations 138 adjacent thescanned image 134, such as shown on display device 106 (FIG. 1). Thetext representations 138 can be displayed adjacent the scanned image 134on display device 106 so that both the scanned image and the textrepresentations are viewable side-by-side.

At block 408, a user-selectable input is received to initiate display ofthe text representations as a visual overlay and, at block 410, the textrepresentations are rendered as the visual overly for display over thescanned image. For example, display application 130 receives auser-selectable input at device 104 to initiate a display of the textrepresentations 138 as a visual overlay, rather than as a transparentoverlay on the scanned image 134. The text representations 138 can bedisplayed as a visual overlay (e.g., the text characters are visualrather than transparent) to illustrate a comparison of the textrepresentations 138 with the corresponding text in the scanned image 134that reflects an accuracy of the character recognition application 126that is utilized to generate the text representations. In an embodiment,the scanned image 134 is displayed in a first color overlaid with thetext representations 138 displayed in a second color that contrasts withthe first color to further illustrate the comparison of the textrepresentations 138 with the corresponding text in the scanned image134.

FIG. 5 illustrates various components of an example device 500 that canbe implemented as any type of device, such as the device described withreference to FIG. 1 to implement embodiments of a combined image andtext document. In various embodiments, device 500 can be implemented asany one or combination of a wired and/or wireless device, computerdevice, portable computer device, media device, communication device,document processing and/or rendering device, appliance device,electronic device, and/or as any other type of device that can beimplemented to process and manage documents. Device 500 may also beassociated with a user (i.e., a person) and/or an entity that operatesthe device such that a device describes logical devices that includeusers, software, firmware, and/or a combination of devices.

Device 500 includes wireless LAN (WLAN) components 502, that enablewireless communication of device content 504 or other data (e.g.,received data, data that is being received, data scheduled forbroadcast, data packets of the data, etc.). The device content 504 caninclude configuration settings of the device, scanned images, textrepresentations, markup documents, and/or information associated with auser of the device. Device 500 can also include one or more datainput(s) 506 via which any type of data or inputs can be received, suchas user-selectable inputs and any other type of audio, video, and/orimage data that can then be processed, rendered, and/or displayed forviewing.

Device 500 can also include communication interface(s) 508 that can beimplemented as any one or more of a serial and/or parallel interface, awireless interface, any type of network interface, a modem, and as anyother type of communication interface. The communication interfaces 508provide a connection and/or communication links between device 500 and acommunication network by which other electronic, computing, andcommunication devices can communicate data with device 500.

Device 500 can include one or more processors 510 (e.g., any ofmicroprocessors, controllers, and the like) which process variouscomputer-executable instructions to control the operation of device 500and to implement embodiments of a combined image and text document.Alternatively or in addition, device 500 can be implemented with any oneor combination of hardware, firmware, or fixed logic circuitry that isimplemented in connection with processing and control circuits which aregenerally identified at 512. Although not shown, device 500 can includea system bus or data transfer system that couples the various componentswithin the device. A system bus can include any one or combination ofdifferent bus structures, such as a memory bus or memory controller, aperipheral bus, a universal serial bus, and/or a processor or local busthat utilizes any of a variety of bus architectures.

Device 500 can also include computer-readable media 514, such as one ormore memory components, examples of which include random access memory(RAM), non-volatile memory (e.g., any one or more of a read-only memory(ROM), flash memory, EPROM, EEPROM, etc.), and a disk storage device. Adisk storage device can include any type of magnetic or optical storagedevice, such as a hard disk drive, a recordable and/or rewriteablecompact disc (CD), any type of a digital versatile disc (DVD), and thelike. Device 500 may also include a mass storage media 516.

Computer-readable media 514 provides data storage mechanisms to storethe device content 504, as well as various device applications 518 andany other types of information and/or data related to operationalaspects of device 500. For example, an operating system 520 can bemaintained as a computer application with the computer-readable media514 and executed on the processors 510. The device applications 518 canalso include a device manager 522, a character recognition application524, a scanning application 526, and a display application 528. In thisexample, the device applications 518 are shown as software modulesand/or computer applications that can implement various embodiments of acombined image and text document.

Device 500 can also include an audio, video, and/or image processingsystem 530 that provides audio data to an audio system 532 and/orprovides video or image data to a display system 534. The audio system532 and/or the display system 534 can include any devices or componentsthat process, display, and/or otherwise render audio, video, and imagedata. The audio system 532 and/or the display system 534 can beimplemented as integrated components of device 500. Alternatively, audiosystem 532 and/or the display system 534 can be implemented as externalcomponents to device 500. Video signals and audio signals can becommunicated from device 500 to an audio device and/or to a displaydevice via an RF (radio frequency) link, S-video link, composite videolink, component video link, DVI (digital video interface), analog audioconnection, or other similar communication link.

Although embodiments of a combined image and text document have beendescribed in language specific to features and/or methods, it is to beunderstood that the subject of the appended claims is not necessarilylimited to the specific features or methods described. Rather, thespecific features and methods are disclosed as example implementationsof a combined image and text document.

1. A method, comprising: generating a scanned image of a documentutilizing a scanning application; generating text representations oftext that is included in the document utilizing a character recognitionapplication; correlating position data of the text representations withlocations of corresponding text in the scanned image of the document;and rendering the scanned image for display overlaid with the textrepresentations as a transparent overlay, the scanned image and the textrepresentations being independently user-selectable for display.
 2. Amethod as recited in claim 1, further comprising generating a markupdocument that includes the text representations, associated displayalignment information, and a reference to the scanned image for display.3. A method as recited in claim 1, further comprising receiving auser-selectable input to display at least one of the textrepresentations without the scanned image, or the scanned image withoutthe text representations.
 4. A method as recited in claim 1, furthercomprising receiving a user-selectable input to display the textrepresentations adjacent the scanned image.
 5. A method as recited inclaim 1, further comprising rendering the text representations as avisual overlay for display over the scanned image to illustrate acomparison of the text representations with the corresponding text inthe scanned image that reflects an accuracy of the character recognitionapplication.
 6. A method as recited in claim 5, wherein the scannedimage is displayed in a first color overlaid with the textrepresentations displayed in a second color that contrasts with thefirst color to further illustrate the comparison of the textrepresentations with the corresponding text in the scanned image.
 7. Amethod as recited in claim 5, further comprising receiving auser-selectable input to display at least one of the textrepresentations without the scanned image, or the scanned image withoutthe text representations as the visual overlay.
 8. A method as recitedin claim 1, further comprising: searching for selected text in the textrepresentations that are displayed in a transparent color over thescanned image; and highlighting the transparent overlay of the selectedtext to display an appearance that the corresponding text in the scannedimage is highlighted.
 9. A document representation system, comprising: ascanned image of a document generated by a scanning application; acharacter recognition application configured to generate textrepresentations of text that is included in the document, and correlateposition data of the text representations with locations ofcorresponding text in the scanned image of the document; and a displayapplication configured to initiate a display of the scanned imageoverlaid with the text representations as a transparent overlay, thescanned image and the text representations being independentlyuser-selectable for display.
 10. A document representation system asrecited in claim 9, wherein the display application is furtherconfigured to generate a markup document that includes the textrepresentations, associated display alignment information, and areference to the scanned image for display.
 11. A documentrepresentation system as recited in claim 9, wherein the displayapplication is further configured to receive a user-selectable input toinitiate the display of at least one of the text representations withoutthe scanned image, or the scanned image without the textrepresentations.
 12. A document representation system as recited inclaim 9, wherein the display application is further configured toreceive a user-selectable input to initiate the display of the textrepresentations adjacent the scanned image.
 13. A documentrepresentation system as recited in claim 9, wherein the displayapplication is further configured to initiate the display of the textrepresentations as a visual overlay for display over the scanned imageto illustrate a comparison of the text representations with thecorresponding text in the scanned image that reflects an accuracy of thecharacter recognition application.
 14. A document representation systemas recited in claim 13, wherein the display application is furtherconfigured to initiate the display of the scanned image in a first coloroverlaid with the text representations displayed in a second color thatcontrasts with the first color to further illustrate the comparison ofthe text representations with the corresponding text in the scannedimage.
 15. A document representation system as recited in claim 9,wherein the display application is further configured to: receive a textsearch request for selected text; initiate a search for the selectedtext in the text representations that are displayed in a transparentcolor over the scanned image; and highlight the transparent overlay ofthe selected text to display an appearance that the corresponding textin the scanned image is highlighted.
 16. One or more computer-readablemedia comprising computer-executable instructions that, when executed,direct a display application to: initiate display of a scanned image ofa document generated by a scanning application; initiate display of textrepresentations over the scanned image as a transparent overlay, thetext representations generated from text that is included in thedocument and generated by a character recognition application; andreceive a user-selectable input to initiate the display of at least oneof the text representations without the scanned image, or the scannedimage without the text representations.
 17. One or morecomputer-readable media as recited in claim 16, further comprisingcomputer-executable instructions that, when executed, direct the displayapplication to initiate the display of the text representations as avisual overlay for display over the scanned image to illustrate acomparison of the text representations with the corresponding text inthe scanned image that reflects an accuracy of the character recognitionapplication.
 18. One or more computer-readable media as recited in claim17, further comprising computer-executable instructions that, whenexecuted, direct the display application to initiate the display of thescanned image in a first color overlaid with the text representationsdisplayed in a second color that contrasts with the first color tofurther illustrate the comparison of the text representations with thecorresponding text in the scanned image.
 19. One or morecomputer-readable media as recited in claim 17, further comprisingcomputer-executable instructions that, when executed, direct the displayapplication to receive the user-selectable input to initiate the displayof the scanned image without the text representations as the visualoverlay.
 20. One or more computer-readable media as recited in claim 16,further comprising computer-executable instructions that, when executed,direct the display application to: receive a text search request forselected text; initiate a search for selected text in the textrepresentations that are displayed in a transparent color over thescanned image; and highlight the transparent overlay of the selectedtext to display an appearance that the corresponding text in the scannedimage is highlighted.